ra 



PROCEEDINGS 

OF SCIENCE 



o 

O 
(N 

Oh- 
< 



o 

o 
\o 
o 

Oh! 






Top Background to SM Higgs Searches in the 
W + W~ -► £v^v Decay Mode at CMS 



G.Davatz 

Institute for Particle Physics, ETH Zurig, Switzerland 

A.S. Giolo-Nicollerat 

CERN, Geneva, Switzerland 

M.Zanetti 

Dipartimento di Fisica "Galileo Galilei" and INFN, Padova, Italy 

The top quark and its properties within and beyond the Standard Model will be extensively studied 
at the incoming Large Hadron Collider. Nonetheless the top quark will play the role of the main 
background for most of the Higgs and new physics searches. In this paper the top as a background 
to H — » W + W~ — > l + vl~V Higgs discovery channel will be studied. The current status of the 
Monte Carlo tools for it and single top simulation will be presented. Finally the problem on how 
to evaluate the top background from the data will be addressed and the related systematics will be 
discussed. 



Top2006, Coimbra, Portugal 



* Speaker. 



© Copyright owned by the author(s) under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike Licence. 



http: //pos .sissa.it/ 



Top background to H -> W+W~ -> l + vrv at CMS M.Zanetti 



1. Introduction 

At the Large Hadron Collider (LHC), the production cross section for processes involving the 
top quark, either in the singly and the doubly resonant mode, is foreseen to exceed 1 nb (respec- 
tively ~ 0.3 nb for single top and ~ 0.8 nb for it). At the designed instantaneous luminosity of 
Jzf = 10 34 cm _2 s _1 , averagely more than 10 events containing top per second are expected. In 
this sense the LHC will be a top factory allowing to study with no statistical limitation the prop- 
erties of the most peculiar particle discovered so far. Moreover, providing copiously most of the 
detectors' observables like (b-)jets, leptons, missing transverse energy (£"™ ss ) etc. , top events will 
be a candle for the comprehension of various experimental systematics and detector performances. 
Nevertheless, at the same time, the top quark will play the role of background for most of the Higgs 
and new physics signatures. 

Studying the top quark as a background instead as a signal is clearly a complete different issue 
that needs different experimental and theoretical approaches with consequent specific systematics 
uncertainties. Moreover, the phase space where the top plays the role of the background is usually 
narrow and contains only the tails of the events' distributions. 

In this paper the top background to the H — » VK+W - — » l + vl~v Higgs discovery channel will be 
studied in details for the case of the CMS detector. 

In the first section, a brief overview of the characteristics of the signal and the top background will 
be given. The following part will focus on the simulation of the top events; the generation of tt 
events will be studied by comparing different Monte Carlo programs, then the inclusion of singly 
resonant top production at Next to Leading Order (NLO) will be discussed. 
The LHC energy regime has never been probed so far, thus any sound analysis must rely on mea- 
sured data to the maximum extent. In the second part, the problem on how to normalize the top 
background using data will be addressed. Finally the experimental uncertainties coming from dif- 
ferent normalization strategies will be estimated using a full CMS simulation. 

2. The H -► WW -► £v£v and the top background 

At the LHC, the H — > VK + W~ — ► l + vl~v Higgs decay mode is considered the most favorable 
one for a Higgs mass ranging between 150 — 170 GeV §\§ [^J. This can be argued from the right 
plot in Figure [j], showing the statistical significance foreseen with 30 fb _1 at CMS for the various 
Higgs channels as a function of mg. 

The signal is characterized by two isolated, high pj and nearby leptons with small invariant mass, 
an high value of £"™ ss and no reconstructed jet in the central part of the detector. The main back- 
grounds are the non-resonant W-pair, the Drell Yan, the tt and the single top in the Wt mode 1 . 
The variable that allows to discriminate the irreducible background from continuous W-pair pro- 
duction is the opening angle between the leptons. Due to scalar nature of the Higgs, the W bosons 
are produced mainly with opposite helicity, then, because of the V-A structure of the Ws coupling 
to leptons, the latters are produced nearby in the space. In the left plot of Figure |l] the opening an- 
gle between the leptons are shown for the signal and the qq — > W + W~ process. Since the angular 



'One may think to consider the complete gauge invariant process W + bW b — * l + vbl vb instead of keeping sepa- 
rate it and tWb. The reason why this is not done will be given in section W 
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distributions of the leptons are crucial variables, the top polarization should be taken into account 
throughout its whole decay chain by the Monte Carlo generators. This issue will be addressed in 
33 













-e-- 0.1 








— 


"O 




















z 










■D 








- 


~0.08 








— 


z 








- 












T_ 










0.06 








- 


0.04 








" 


0.02 








- 


0.5 


1.5 


> 2.5 3 





WH.H^bH 

zz-zzz-j rri 




500 700 

m H (GeV/c 2 ) 



Figure 1: Left: opening angle between the leptons (<j>u) f° r the signal (blue) and the W-pairs background 
(black). Right: statistical significance foreseen with 30 fb~ ' at CMS for various Higgs channels as a function 
of mij. 

The most powerful cut to reduce the top events is the jet veto which requires not to have 
reconstructed jets above a certain Ej threshold in the central part of the detector. Dealing with 
low Ej jets in the LHC environment is a delicate issue either from the experimental and from the 
theoretical point of view. More details about this selection will be given in |5T]. When the problem 
of the NLO description of Wt will addressed, it will also be remarked how the application of a jet 
veto helps in consistently separate this process from ft. 







H -► WW (m H = 165 GeV) 


it 


tWb 




axBR(e,jU,T) [fb] 


2360 


86200 


3400 


1) 


Trigger 


1390 (59%) 


57380 (67%) 


2320 (68%) 


2) 


lepton ID 


393 (28%) 


15700 (27%) 


676 (29%) 


3) 


E™ ss > 50 GeV 


274 (70%) 


9332 (59%) 


391 (58%) 


4) 


<fe < 45 


158 (58%) 


1649 (18%) 


65 (17%) 


5) 


12 GeV < mu < 40 GeV 


119(75%) 


661 (40%) 


28 (43%) 


6) 


30 GeV< pf max <55 GeV 


88 (74%) 


304 (46%) 


13 (46%) 


7) 


pf min >25 GeV 


75 (85%) 


220 (73%) 


9.2(71%) 


8) 


Jet veto 


46(61%) 


9.8 (4.5%) 


1.4(15%) 



Table 1: The expected number of events for a luminosity of 1 fb -1 for the signal with a Higgs mass of 
165 GeV and the it and tWb background. The relative efficiency with respect to the previous cut is given 
inside the brackets in percent. 

In Table |l| the complete list of selections used together with the corresponding number of 
events expected for 1 fb -1 for the fully simulated signal (for a Higgs mass of 165 GeV), it and Wt 
are summarized. The rejection for it is i^(10~ 4 ), which sets a challenge for the needed precision 
of the Monte Carlo calculations. Moreover the presence of two neutrinos in the signal final state 
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does not allow the reconstruction of a narrow invariant mass peak; the discovery has then to rely 
on an excess in the expected number of background events. It is thus necessary to identify a phase 
space region where reliably control the contribution of the different backgrounds. In section ||, two 
procedure for the it normalization are proposed and discussed. 

3. it background generation 

In this section the generation of top-pair process (pp — ► tt — > WbWb — > ^v^vbb, with i = e, }X 
and t) will be discussed by comparing four different Monte Carlo (MC) generators. Three points 
will be addressed: first we will estimate how much the Born level description differs from the NLO 
one in the phase space relevant for the Higgs search. Then we will determine whether different 
showering models cause differences in the relevant kinematics distributions. Finally how much the 
inclusion of the spin correlation between the top quarks affects the leptons' angular distribution 
will be estimated. All the following studies have been done at parton level, without exploiting a 
full detector simulation. 

3.1 NLO effects on it simulation 

To estimate the effect of an accurate inclusion of NLO matrix elements, HERWIG 6.508 [^] 
(LO, parton shower Monte Carlo) and MC@NLO 2.31 [Q] (NLO, parton shower Monte Carlo) were 
compared. The spin correlations between the t and t are not considered in MC@NLO. HERWIG 
events were therefore consistently simulated without such correlation. As the same showering 
model is used, the difference between the two simulations should be mostly due to the additional 
NLO matrix elements in MC@NLO. 
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Figure 2: Left: jet multiplicity. Center: leading jet Ej. Right: ttansvers momentum of the tt system. The 
little windows in the center and right plots show the deviation at high pj in logarithmic scale. 

In Figure ||, from left to right, the number of jets, the Ej of the leading jet and the pi of the it 
system are shown 2 . MC@NLO produces in addition to the hard process up to one hard jet whose 
spectrum is accurate at NLO. Most of the jet activity in the events is however dominated by the 
two b-quarks from the two top quarks decay. None of the three distributions shows indeed relevant 
differences between the NLO and LO. Typical NLO effects can be noticed in the high part of the 



"this variable gives an estimation of the spectrum of the additional jets to the hard process 
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spectrum either in the leading jet Ej distribution and in the it system transverse momentum: the 
parton shower is in fact known to describe correctly only the soft part of the extra jet activity. Nev- 
ertheless the region relevant for the H — ► WW — ► £v£v signal selection is the very low p t region, 
where HERWIG and MC@NLO agree very well. In addition, the shapes of all the other cut vari- 
ables are very similar in MC@NLO and HERWIG without spin correlations. 
When comparing the relative efficiencies of the different cuts, the two Monte Carlos differ essen- 
tially only for the jet veto cut, the difference in the efficiency being ^(10%) [|J]. Since the region 
where NLO makes a difference is at very high p t , whereas the bulk of the selected events is in the 
low p t region, it is safe to conclude that NLO effect can be simply included by rescaling the cross 
section by an inclusive factor. 

3.2 Effect of showering models, differences between HERWIG and PYTHIA 

In the following, how different showering models influences the variable shapes and selection 
efficiencies will be studied. For this, PYTHIA 6.325 |jg], based on the Lund hadronization model, 
was compared with HERWIG based on the cluster model for hadronization 3 . For both simulations, 
default scales were chosen. 
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Figure 3: Left: jet multiplicity. Center: tt system pj. Right: pj of the most energetic lepton after the jet 
veto 

The left and central plots of Figure || shows respectively the number of jets and the pj of 
the it system. The two showering models differ remarkably in both distributions, the Lund model 
predicting less and softer extra jets. The effect of these discrepancies in the jet veto efficiencies is 
sizable, i.e. about 20%. 

However, the shapes of the relevant leptons' kinematics distributions in the signal region are very 
similar for the two Monte Carlo's. This can be seen from the right plot of of Figure || showing the 
pj spectrum of the most energetic lepton after applying the jet veto. 



3.3 Effect of the spin correlations 

As explained in § the variable that characterizes more the signal is the opening angle between 
the two leptons. This observable, as well as the mass of the dilepton system and the E™ 



is 



PYTHIA does not take into account the spin correlations between the top and the anti-top , then in order to 
consistently study only the differences caused by the showering model, HERWIG with disabled spin correlations has 
been used 
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sensitive to the spin correlations between the particles involved in the process. In order to point 
out the effect of the inclusion of the top polarization in the it decay chain on the lep tons' angular 
distribution, PYTHIA has been compared with TopREX [|7||. While the former does not consider 
the top spin along its decay, the latter is a matrix element based Monte Carlo describing exactly 
2— >6 processes with LO precision. 
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Figure 4: 0# distribution for TopREX events with and without spin correlations is shown, as well as 
PYTHIA. On the left, only very basic cuts are applied, whereas on the right a jet veto is applied in ad- 
dition. The region important for the Higgs signal search is the low <j>u region. 

In the left plot of Figure ^| showing the angle <j>gg between the leptons a non negligible differ- 
ence between the two Monte Carlo's prediction can be seen. Quantitatively, the variation of the 
efficiency for the (j)^ cut between the PYTHIA and TopREX is ~ 10%. As it can be seen from the 
right plot of Figure |], in the signal region, i.e. after applying the jet veto, the difference between 
the two distributions, even if still present, tends to flat. 



4. Singly resonant top background generation 

Although the Wt mode of the single top production has an estimated cross section one order 
of magnitude times smaller than the it production, the application of the jet veto enhances the 
contribution of the former with respect to the latter's. Considering also that the single top process 
has never been measured at the Tevatron, it is important to pursuit a NLO description of this process 
that allows a precise estimation of its features and its total cross section. In principle one could think 
to use the singly and doubly resonant processes together i.e. to consider pp — > WbWb — » £v£vbb 
which is naturally gauge invariant and describes correctly all the interference terms. Nevertheless 
the NLO corrections are not available for such a process and in particular it is not known how to 
deal with the arising large logarithms of the form \og((m t + mw)/mb). In is therefore preferable to 
view the singly resonant process as one in which a b quark is probed directly inside the proton [||] 
(right diagram in Figure |5]). 

The diagrams contributing to the NLO description of Wt includes the LO doubly resonant it 
(left diagram in Figure |5|). A solution to that has been provided by Campbell and Tramontano in 
Ref. [Q] where they suggest to define a specific Wt final state by imposing a veto on the presence of 
an extra b quark. In this schema, also called "b-PDF approach", the pi threshold for the spectator 
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Figure 5: Examples of Feynman graphs for double (left) and single (right) top production 



b-quark (coming from an initial gluon splitting) is set at the same value of the factorization scale 
used for the PDF. 

This approach perfectly fits with the case of the Wt background for the H — ► W+VK - — > 
l + vl~v searches where a global jet veto is applied. In Ref. [||] the efficiencies for Wt events 
for the signal leptons' selections have been compared between LO plus parton shower (TopREX + 



PYTHIA) and NLO (MCFM [10]) descriptions, showing an overall agreement. After matching the 
veto threshold on the b-quark with the threshold for the jet veto, it is possible to consistently use 
the overall normalization provided by the NLO calculation for the cross section in the signal phase 
space area. The ratio between the LO and NLO cross section goes from 1.4 before the application 
of the cuts, to 0.7 after the selection. This is consistent with the fact that NLO calculations enhances 
the jet activity thus reducing the jet veto efficiency. 



5. tt normalization from data 



The commonly used method to normalize a given background from the data consists on se- 
lecting a signal-free phase space region (control region) where a given background process is en- 
hanced. The contribution of that background in the signal region is then extrapolated from the 
measured amount of events in the control region. This procedure relies on the relation: 

AjMonteCarlo 
M _ signal _reg M _ G signal _reg ' ^-signal _reg AJ « ,x 

M signal _reg — MonteCarlo ^ control _reg— ^control_reg JJ--U 

lv — *„.,; V control res ' ^control ree 



\jMontetarlc 
control _reg 



'control_reg ' &control_reg 



where N^""^ c " e r '° and N^' r e f a ^" are the numbers of events predicted by the Monte Carlo simu- 
lation in the signal and control region. Each of this two numbers can be expressed as a product of 
the theoretical cross section in that phase space area, <y S ignai_reg,control_reg> an d the experimental ef- 
ficiency of reconstructing events in the same region, £ s ignai_reg.controi_reg 4 ■ This will allow to better 
point out the different sources of systematic uncertainties. In particular the theoretical predictions 
enter the procedure only via the ratio <J s ignai_reg I ' Gcontrol_reg> leading to a much smaller scale de- 
pendency and thus to smaller theoretical uncertainties. 

The theoretical issues concerning the it normalization have been deeply studied in [|I3|], following 
the work done in the 2003 Les Houches Workshop. The primary goal here is to provide a reliable 



The experimental uncertainties could modify the boundaries defining the phase space where the cross section is 
calculated theoretically. This is the case in particular when the selections involve jets. The "e" terms in relation (5.1) are 
assumed to account also for this effect. 
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description of the experimental aspects, specifically the ones related to the CMS detector. For this 

study a full detector simulation has then been exploited. 

The main requirement from the experimental side on the choice of the control region is to limit as 



much as possible the error due to the "e" terms in relation (p.l|). This implies to use similar selec- 
tions as for the signal region. Moreover the contamination from other physical and instrumental 
backgrounds should be negligible. 

In the following the signal phase space area and two possible control regions for it normalization 
are described, focusing on the related experimental issues. Each of these phase space regions (ei- 
ther the signal and the control ones) is defined by the selections on the leptons listed in Table |l] 
(items 1-8). 

5.1 Signal region 

We already stated before that the signal region is defined by requiring not to have reconstructed 
jet above a certain Ej threshold in the central part of the detector. Clearly the lower the Ej thresh- 
old, the higher is the rejection for tt background events. At CMS the definition of a jet at low 
Ei is experimentally problematic. This because the 4 Tesla magnetic field of the CMS solenoid 
spread the jet constituents in the transverse plane and prevents the low momentum charged tracks 
produced during the fragmentation even to reach the calorimeters. Moreover the LHC, in addition 
to the products of the hard scattering, produces thousands of charged and neutral particles some of 
which may have rather high pj, thus enhancing the jets fake rate. 

In order to avoid a high rate of fake jets at low Ej, the tracking measurements are exploited in the 
jet definition. For jets between 15 and 20 GeV the sum of the pj of the tracks belonging to the jets 
(i.e. the tracks coming from the primary vertex which stand within the jet cone) is required to be 
at least the 20% of the jet E T @|. 
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Figure 6: Left: a distribution for matched and un-matchedjets. Right: fraction of matched jets as a function 
of the reconstructed jet Ej 

As it can be seen from the right plot in Figure ^, the fake jets (i.e. not matched with any jet at 
parton level) rate below 20 GeV decreases remarkably. The distribution of a = Lpj (tracks) jEj (jet) 
from matched and un-matched jets is shown in the left plot of Figure ^. 

5.2 b-tagging jets based it control region 

The request for two b-tagged jets is the most natural for the definition of a control region for 
the it background. In this study, the algorithm used to discriminate whether a jet is originated from 
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a ft quark is based on the impact parameters of charged particle tracks associated to the jet [ 1 1 ] . The 
parameter that characterizes the efficiency and the mistagging rate of the algorithm is the impact 
parameter significance (Gjp) of a minimum number of tracks associated to the jet. In this study a jet 
is tagged as a ft-jet if its measured E t is greater then 20 GeV and if there are at least 2 tracks whose 
dp is greater than 2. In this case the double b-tagging efficiency is ^(30%) while the mistagging 
rate is £?(3%). Table ^summarizes the number of events expected for 10 fb -1 in the control region 
for it, Wt and the signal in the case of 2\i, 2e and e\i final states. 

Not all the processes with 21 + 2b + Ep iss as final state have been fully simulated for this anal- 
ysis, nevertheless general considerations and fast Monte Carlo level checks can lead to exclude 
other relevant sources of backgrounds. 

The more natural concurrent process is the not resonant W + VK~ —>2£ + bb which is anyway sup- 
pressed with respect to it. Its cross section is indeed expected to be smaller than 1 pb. Assuming 
the same efficiency for the kinematic selections as for the W + W~ — ► 2£, i.e. ^?(10~ 3 ), less than 
10 events are aspected for 10 fb~ x in the control region even without folding the double-b tagging 
efficiency. In the case of same flavor leptons in the final state, y* jZ* —>2£ + bb (the vector boson 
mass being away from the Z peek, i.e. mu < 40 GeV) could also contribute as an instrumental 
background, when an high value of E" ms is provided by the not full hermeticity of the detec- 
tor and/or due the finite resolution of the calorimeters. Anyway for a fully simulated sample of 
f/Z* -^2£ + 2b with jets' E t greater than 20 GeV, the fraction of events with Ef ss > 50 GeV (the 
actual cut applied for the signal selection) is ^(10 _2 ). Applying the same kinematic selections, 
but the Ef hs cut on a. pp^ f/Z* -^2£ + bb sample generated with MadGraph Monte Carlo [JTJ], 
200 events are expected for 10 fb~ l , which reduce to a negligible quantity if the rejection due to a 
realistic E" ms selection is included 5 . 

5.3 Two high E, jets based it control region 

In order to avoid the systematics due to the b-tagging algorithm it is worth to have alternative 
methods to estimate the it background from data. Each of the two ft's in the it final state come from 
a 175 GeV central object; their E t spectra are then rather hard. An alternative method to define a 
it control region is thus simply to require, in addition to the signal kinematic cuts listed in Table [j], 
two hard jets in the detector. 

In order to avoid the contamination from Drell Yan which in the case of 21 + 2j final state has a 
much higher cross section then the 2£ + 2b one, only e\i final state has been considered. 
The thresholds on the jets' transverse energy that maximize the signal (it) over the background 
(Wt+signal) ratio and minimize the statistical error have been found to be 50 and 30 GeV. The 
number of events expected events for 10 fb _1 for it, Wt and the signal are summarized in Table ||. 

A background process not considered in the full simulation analysis is W + W~ — » /iv^ +ev e + 
2j. The cross section, after geometrical acceptance cuts, is 0.4 pb, whereas the signal selection cuts 
efficiency resulted to be smaller than 5 • 10 (with a statistical error of ~ 8%). The contribution of 
this background can then be assumed to be at maximum of the order as the signal. 



5 In the y* /Z* — > 2£ + 2b fully simulated sample (the only one available) the bb pair comes only from a gluon 
splitting, the main mechanism of f/Z* +2b not being included. That is the reason why the selection cuts have been 
applied at parton level on a MadGraph sample 
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"b-tagging" control region 


"hard jets" control region 


Signal region 


2\i 


2e 


ejl 


2\i 


2e 


ejl 


2\x 


2e 


ejl 


it 


194 


107 


245 


- 


- 


411 


33 


22 


44 


Wt 


1 


<1 


2 


- 


- 


6 


5 


3 


6 


Signal [piji = 165) 


<1 


<1 


1 


- 


- 


11 


156 


89 


214 



Table 2: Number of events of tt, Signal and Wt expected for 10 fb ' in the two control regions described 
above and in the signal region. Results are shown for 2ix, 2e, e\i final states. 

In the case one jet is misidentified as an electron, W ± — > jiv^ +3j, could be a source of background 
too. At CMS, the probability of electron misidentification is estimated to be <^(10~ 4 ) 6 . Given its 
cross section, calculated to be ~ 200 pb after the geometrical acceptance cuts, the latter rejection 
factor together with the kinematic selection efficiency -estimated again from a generator level study 
to be <^(10~ 4 )- lead to neglect this process as a source of contamination of the tt control region. 

6. tt normalization procedure uncertainties 

Our proposed procedures to estimate the number of tt events in the signal phase space region 
exploits relation ( |5.1| ). In order to compute the systematic uncertainties on the final result we 
consider separately those related to each term present in the formula. 

Theoretical uncertainty. 

Taking the ratio of the tt cross sections in the signal and control region avoids much of 



the theoretical systematic uncertainties. In Ref. [ ]13| ] the theoretical uncertainty on the ratio 
'signal _reg i / o 'control _reg has been studied at parton level with LO precision by varying the renor- 
malization and factorization scale. The error has been estimated to range between 3% to 10%, 
mostly due to the choice of the PDF. For what it has been shown before, the theoretical error can 
be larger because of other factors, mainly the parton shower model. A 10% systematical error due 



to theoretical uncertainty will be assumed as reported in Ref. [13], although baring in mind that 
this could be an optimistic estimation. 

Jet Energy Scale uncertainty. 

In the background normalization procedures we proposed, the jet energy scale (JES) uncer- 
tainty is particularly important since it affects in opposite manners the signal region, defined by 
vetoing the jets, and the control region where the presence of two jets is required. To take into 
account this sort of anticorrelation of e s i gna i_ reg and E contro i_ reg , we estimate the effect of the JES 
uncertainty directly on their ratio by rescaling the measured jet four momentum by an amount cor- 
responding to the percentual uncertainty (i.e. P^ et = (1 +X)P^ et ). 
In the plot of Figure (f7b the relative variation of " s " n/ -"' s for various values of A is shown. In the 

* — * ^-control reg 

plot the triangles represent the control region defined by requiring two jets with E, greater then 50 



The muon misidentification rate is at least one order of magnitude smaller 
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and 35 GeV, whereas the squares stand for the control region defined by requiring two b-tagged 
jets 7 . A realistic estimation of the JES uncertainty at CMS after integrating 10 fb~ { of LHC is 
The the corresponding relative variation of £ s ignai_reg/£controi_reg is ~ 8% for the double 



b-tagging defined control region and ~ 10% for the two high E t jets control region. 
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Figure 7: Relative variation of 



^signal reg 



as a function the jet momentum rescaling factor (A). The red 



^control reg 

triangles represent the control region defined by two hard jets whereas the black squares correspond to the 
two b-tagged jets phase space area. 

a criterion uncertainty. 

To estimate the systematic uncertainty due to a criterion, the value of the cut has been varied 
from 0.15 to 0.25. Moreover different values of the minimum p t for a track to be included in the 
sum have been tried, from 2 to 3 GeV. The consequent variation of the jet veto efficiency (£ s ignai_reg) 
is relatively small, i.e. of the order of 4%. 

b-Tagging uncertainty. 



In Ref. [14] the precision with which the b-tagging efficiency will be known at CMS is ex- 
pected to be 11% for 1 fb integrated luminosity and it is foreseen to improve till 7% with 
10 fb . These values represent directly the uncertainty on £ CO ntrol_reg m the case of the control 
region defined by requiring two b-tagged jets. 

Uncertainties on N contro i_ reg . 

It has been shown in the previous section that it is plainly the dominant process in both the 
proposed control regions. In the worst case, i.e. when the control region is defined by two high 
E t jets, the fraction of events coming form other processes is smaller than 4%. Provided that this 
fraction is small, it is safe to simply neglect this source of systematic. 

For 10 fb~ l the experimental uncertainties listed above accounts for a systematic error of 
~ 11% for both the it control regions. Including the theoretical uncertainty this error does not 
exceed 16%. 



7 The reason way the ratio £ s i gna i regl ^control reg m tne latter case is less sensitive to the JES uncertainty is that the 
E, threshold for the b-jets candidates is 20 GeV and the fraction of it events with b-tagged jets with E, close to that 
threshold is very small. 
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Statistical uncertainties 

The statistical precision with which the number of it events in the signal region can be known 
depends on the expected number of it events in the control region. From the numbers quoted in 
Table ^, assuming a poissonian behavior, it is clear that the error due to systematic uncertainties is 
predominant with respect to the statistical ones for both the proposed normalization procedures. 

7. Conclusions 

The searches for the Higgs boson at the LHC via the channel H — ► W + W — ► l + vl~v offers 
the possibility to study the it and Wt in a peculiar phase space region. This represents a major 
challenge either from the theoretical and from the experimental point of view. 
The present status of the Monte Carlo tools for it simulation has been discussed. The comparison 
between a set of generators shows that the NLO effects can be safely accounted as a global rescaling 
of the total cross section. On the contrary the PYTHIA and HERWIG showering models differ in 
predicting jets multiplicity and energy spectra. Finally the spin correlation between the top-pair 
induces a variation of ^(10%) in the leptons' selection cuts efficiency. In section |]it has been 
show that the Wt process can be reliably calculated at NLO in the signal region defined by a jet 
veto without double counting with it. 

Finally the normalization of the it background have been discussed. Two control regions have been 
proposed, one based on b-tagging the jets coming from the top-pair and the other by requiring two 
high Ej jets. Both approaches provide a reliable phase space area dominated by it events and lead 
to an overall systematic uncertainty of 16%. 
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